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INTEGRATED AUDIOA^IDEO SENSOR 

CROSS-REFERENCE TO RELATED APPLICATION 
5 This application claims the priority benefit of Taiwan application serial no. 

92129226, filed on October 22, 2003. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

10 [0001] The present invention relates to an audio and video sensing device. 

More particularly, the present invention relates to a compact and inexpensive integrated 
audio/video sensor and system capable of receiving image and sound signals 
synchronously and resolving image/sound sensing circuit compatibility problem, 

15 Description of the Related Art 

[0002] In modem society, consumer electronics products play an increasing 
important role in our daily life. To be popular, these electronic products must be light 
and compact, have a low operating voltage and consume as little power as possible. In 
recent years, most consumer products are aiming for portability. In other words, 

20 inventors are seeking ways to integrate formerly individual devices into a smaller unit 
so that the integrated product is rendered lighter and easier to carry. For example, the 
integration of a personal computer with a real-time video information system has 
permitted instant video communication through a network. Similarly, the integration 
of mobile phone with real-time video information system has permitted instant video 
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communication through a wireless telephone network. Another example is the 
integration of audio and video sensors into a single unit. 

[0003] In the past, the audio sensor and the video sensor are formed separately. 
FIG. 1 is a schematic cross-sectional view of a conventional video sensor. As shown 
5 in FIG. 1, a conventional video sensor 100 comprises a lens 102 on a lens mount 104, a 
holder 106, an infrared filter plate 108, an image-sensing chip 1 10, a passive component 
112 and an image processor 114. The lens 102 receives light from an image, passes 
the light through the infrared filter plate 108 and finally focuses the Ught on the image- 
sensing chip 110. The image-sensing chip 110 transforms the received light into 

10 image signals. Thereafter, the image signals are transmitted through the peripheral 
circuits, for example, a passive device 112 into to the image processor 114 for fiirther 
processing. In general, if the aforementioned image-sensing device is a 
complementary metal-oxide-semiconductor (CMOS) image-sensing module, the cost of 
producing the image sensor is lower and the overall size of the image sensor are smaller 

15 than a conventional charged couple device (CCD). 

[0004] FIG. 2 is a schematic sectional view of a conventional audio sensor. As 
shown in FIG. 2, a conventional audio sensor comprises a polymer diaphragm 202, a 
spacer pad 204, a back plate 206, a media substance 208, an impedance converter 210 
and a power source/peripheral circuit 212. In the past, carbon powder was inserted 

20 into the space between the polymer diaphragm 202 and the back plate 206. In recent 
years, the carbon powder has been replaced by air. When the diaphragm 202 receives 
any acoustic vibration, the carbon powder between two electrodes will be compressed 
or relaxed, therefore the impedance between the polymer diaphragm 202 and the back 
plate 206 is changed. The impedance is measured and converted by the impedance 
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converter 210. The drawback of these kinds of products is their high noise content and 
relative instability. 

[0005] Accordingly, the mechanical structure and the electronic circuits of a 
conventional image sensor and that of a conventional audio sensor are different. 
5 Hence, in most of the conventional audio/video sensing systems, the audio sensor and 
the video sensor are just assembled together for convenience. In the audio/video 
sensing system, the sensing device and circuits for audio sensing and image sensing are 
independent from each other. Therefore, a lot of space may be wasted and the cost is 
high. Furthermore, the images and sounds collected via the different sensing devices 
10 are not synchronous due to the compatibility of the sensing circuits. Thus, an 
integrated audio/video sensor is required. 

SUMMARY OF THE INVENTION 
[0006] Accordingly, one object of the present invention is to provide an 
15 integrated audio/video sensor that occupies a smaller space, costs less to produce and 
has the capability to synchronize collected video and audio signals. 

[0007] To achieve these and other advantages and in accordance with the 
purpose of the invention, as embodied and broadly described herein, the invention 
provides an integrated audio/video sensor. The integrated audio/video sensor 
20 comprises an image-receiving module, a sound-receiving module and a signal- 
transforming module. The image-receiving module receives images. The sound- 
receiving module receives sound. The signal-transforming module receives the 
received images and the received sound and integrates the images and soimd into an 
audio/video signals. 
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[0008] This invention also provides an integrated audio/video processing system 
comprising an integrated audio/video sensor and an audio/video system. The 
integrated audio/video sensor comprises an image-receiving module, a sound-receiving 
module and a signal-transforming module. The image-receiving module receives 
5 images. The sound-receiving module receives sound. The signal-transforming 
module receives the images and the sound and integrates the image and sound signals 
into audio/video signals. The audio/video system performs a post-processing of the 
audio/video signals. 

[0009] In the aforementioned integrated audio/video sensor, the signal- 
10 transforming module further comprises an image-sensing chip, an audio amplifier, an 
audio/video processing chip and a peripheral chip. The image-sensing chip senses 
images and outputs the audio/video signals. The audio ampUfier senses and amplifies 
soimd and outputs the audio/video signals. The audio/video processing chip performs 
a post-processing of the audio/video signals. The peripheral circuit chip is a device for 
15 accommodates the circuits of other functional components. 

[0010] In the aforementioned integrated audio/video sensor, the image-sensing 
chip c£in be a complementary metal-oxide-semiconductor (CMOS) image-sensing 
module or a charged coupled device (CCD). 

[0011] In the aforementioned integrated audio/video sensor, the method of 
20 fabricating the signal-transforming module includes a multi-chip module (MCM) or a 
system on a chip (SOC) fabrication method. 

[0012] In the aforementioned integrated audio/video sensor, the signal- 
transforming module has the capability to receive and synchronize image and soimd 
simultaneously. 
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[0013] In the aforementioned integrated audio/video sensor, the audio/video 
signal comprises an image signal and an audio signal. 

[0014] In the aforementioned integrated audio/video sensor, the sound-receiving 
module further comprises a condenser microphone. 
5 [0015] In brief, this invention utilizes, for example, a multi-chip module (MCM) 

or a system on a chip (SOC) technology to integrate the image-sensing module, the 
sound-sensing module and the signal-transforming module together into an integrated 
audio/video sensor. With this arrangement, the space occupation can be significantly 
reduced and the fabrication cost of the sensor is greatly reduced. Furthermore, the 
10 compatibility between the video sensor and the audio sensor is resolved so that video 
and audio signals can be collected synchronously. 

[0016] It is to be understood that both the foregoing general description and the 
following detailed description are exemplary, and are intended to provide further 
explanation of the invention as claimed. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0017] The accompanying drawings are included to provide a further 
understanding of the invention, and are incorporated in and constitute a part of this 
specification. The drawings illustrate embodiments of the invention and, together with 
20 the description, serve to explain the principles of the invention. 

[0018] FIG. 1 is a schematic cross-sectional view of a conventional video 

sensor. 

[0019] FIG. 2 is a schematic cross-sectional view of a conventional audio 

sensor. 
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[0020] FIG. 3 is a schematic cross-sectional view of an integrated audio/video 
sensor according to one preferred embodiment of this invention. 

[0021] FIG. 4 is a schematic cross-sectional view illustrating the basic structure 
of a condenser microphone according to this invention. 
5 [0022] FIG. 5 is a partially perspective view illustrating the basic structure of a 

sound-sensing module according to this invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
[0023] Reference will now be made in detail to the present preferred 
10 embodiments of the invention, examples of which are illustrated in the accompanying 
drawings. Wherever possible, the same reference numbers are used in the drawings 
and the description to refer to the same or like parts. 

[0024] FIG. 3 is a schematic cross-sectional view of an integrated audio/video 
sensor according to one preferred embodiment of this invention. Referring to FIG. 3, 
15 an integrated audio/ video sensor comprising an image sensor and a condenser 
microphone is shown. FIG. 4 is a schematic cross-sectional view illustrating the basic 
structure of a condenser microphone according to this invention. FIG. 5 is a partially 
perspective view illustrating the basic structure of a sound-sensing module according to 
this invention. 

20 [0025] As shown in FIG. 4, the condenser microphone 400 comprises an outer 

cavity casing 402, a fixed inner cavity casing 406, an insulator 410 and an impedance 
converter 412. The outer cavity casing 402 has a membrane 404 thereon and the fixed 
inner cavity casing 406 has an air cavity 408 therein. The insulator 410 is disposed 
between the outer cavitycasing 402 and the fixed inner cavity casing 406. The 
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membrane 404 is a permanently charged electric material that provides a permanent 
polarized voltage. The membrane 404 combined with the fixed inner cavity casing 
406 enclosing the air 408 constituting a capacitor. For this type of capacitor, the value 
of the capacitance can be estimated using a capacitor formula (1) that is approximate to 
5 a parallel plate capacitor: 

C^Q/V^E.AID 

where C is the capacitance, Q is the amount of electric charges, V is the voltage 
between the membrane 404 and the fixed cavity casing406, A is the area of the 
membrane 404, D is the distance between the membrane 404 and the fixed cavity casing 
10 406, €^ is the dielectric constant of the dielectric material. If the dielectric material is 
air, e^= \. When sound waves are received by the membrane 404, the membrane 404 
vibrates and causes the distance D to change. Through the variation of distance D with 
time, an inductive current I is produced according to formula (2) listed below: 
/ = dQIdt = d{s^AVID)ldt = e^AVd{\l D)l dt 

15 where I is the inductive current, Q is the amount of electric charges, d(l/D)/dt is 

the variation of the inverted distance 1/D with time t. The inductive current I is 
electrically coupled to a field effect transistor (FET) converter so that the inductive 
current I is converted firom a high impedance signal to a low impedance output signal. 
Altematively, the inductive current I can be directly transferred to an audio amplifier for 

20 fixrther processing. Since this type of condenser microphone has a very thin vibration 
membrane and directly output the voltage according to the acoustic pressure, the 
fi-equency response has a very broad and flat range. The actual fi-equency response 
may range fi-om an ultra low fi-equency of a few Hz to a supersonic fi-equency of several 
tens of KHz. In other words, the condenser microphone has a high sensitivity, a low 
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self-induced noise and a low signal distortion. However, one of the most important 
aspect is the incorporation of an air cavity 408 within the microphone, so that air serves 
as the dielectric medium instead of carbon powder in a conventional microphone. 
With air as a dielectric medium, the microphone can be easily integrated with an image 
5 sensor. 

[0026] As shown in FIG. 3, an image sensor and a condenser microphone are 
integrated together to form an integrated audio/video sensor. The integrated 
audio/video sensor comprises an image-receiving module 302, a sound-receiving 
module 304, a signal-transforming module 306 and a substrate 308. The image- 

10 receiving module 302 further comprises a lens 312 on a lens mount 314, a holder 316 
and an infrared filter plate 318. The sound-receiving module 304 fiirther comprises an 
outer cavity casing 322 with a membrane 324 thereon, a fixed inner cavity casing 326 
and an electrical insulator 330 between the outer cavity casing 322 and the fixed inner 
cavity casing 326. The signal-transforming module 306 further comprises an image- 

15 sensing chip 332, an audio/video processor chip 334, an audio amplifier chip 336 and a 
peripheral circuit chip 338. 

[0027] Referring to FIG. 3, a light of an image is received by the lens 312 within 
the image-receiving module 302, and is transferred into the infrared filter plate 308. A 
portion of the light that has a wavelength in the range of or larger than an infrared is 

20 filtered by the infrared filter plate 308, and then the light is focused onto the image- 
sensing chip 332. Finally, the light of the image received by the image-sensing chip 
332 is transformed into an image signal. In addition, the image signal can also be 
transmitted to the audio/video processor chip 334 for further processing. In this 
embodiment, the image-sensing chip includes, for example, but not limited to, a charged 
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couple device (CCD) or a complementary metal-oxide-semiconductor (CMOS) image- 
sensing module. 

[0028] The membrane 324 within the sound-receiving module 304 is a 
permanently charged electric material that provides a permanent electrode voltage. 
5 The membrane 324 and the fixed inner cavity 326 with an air cavity 328 constitute the 
two electrodes of a capacitor. If the air cavity 328 is filled with air, the dielectric 
constant is approximately 1 . When soimd waves are received by the membrane 324, 
the membrane 324 vibrates and the capacitance of the capacitor will change to generate 
a voltage variation and an inductive current. The output voltage or inductive current is 

10 transmitted into the audio amplifier for further processing and finally converted to an 
audio signal. The audio signal is transmitted into the audio/video processor chip 334 
to perform the post-processing operations. Thus, the sound-receiving module 304 of 
this embodiment has some advantages of a condenser microphone. Furthermore, 
because the vibrating membrane is extremely thin, the voltage difference and the 

15 induced current generated firom acoustic pressure can be output directly without a lot of 
noise similar to the prior art. Hence, the actual firequency response of the condenser 
microphone may range from an ultra low frequency of a few Hz to a supersonic 
frequency of several tens of KHz. In other words, the sound-receiving module has a 
high sensitivity, a low self-induced noise and a low signal distortion. 

20 [0029] As shown in FIG. 5, the sound-receiving module includes, for example, 

but not limited to, an outer cavity casing 322, a membrane 324, a fixed inner cavity 
casing 326, an air cavity 328 and an electrical insulator 330. The electrical insulator 
330 electrically isolates the outer cavity casing 322 from the fixed inner cavity casing 
326. In general, a condenser microphone has a cylindrical shape with the membrane 



9 



FILE: 11 670USF.RTF 

324 located on an upper cross-sectional surface 352 or a lower cross-sectional surface 
354 of the cylinder. In this invention, however, an image-receiving module 302 is 
installed on the upper surface 352 and a signal-transforming module 306 is installed on 
the lower surface 354. Therefore, the sound-receiving membrane 324 is set up on one 
5 side of the cylinder. Obviously, the setup between the membrane, the image-receiving 
module and the signal-transforming module is not limited to the embodiments described 
above. For example, the signal-transforming module 306 may be positioned between 
the upper and the lower surface of the cylinder so that the acoustic membrane can be set 
on the lower surface. 

10 [0030] The image-sensing chip 332 inside the signal-transforming module 306 is 

used to detect and output video signals. The audio amplifier chip 336 is provided for 
detecting, amplifying and outputting audio signals. The audio/video processor chip 
334 performs a post-processing of the image signals and the audio signals. The 
peripheral circuit chip 338 is a device for providing circuits to other functional 

15 components. The substrate 308 can be a flexible printed circuit (FPC), for example. 

[0031] In this invention, a multi-chip module (MCM) or a system on a chip 
(SOC) technique can be provided to integrate all the chips 332, 334, 336 and 338 
together on the substrate 308. After the integration, not only the signal-transforming 
module 306 is more compact, but also the compatibility issue between the image and 

20 audio sensor is resolved. Hence, both the image signal and the acoustic signals can be 
collected synchronously. If the substrate 308 is fabricated using a flexible printed 
circuit board, a space is much more compact. 

[0032] Accordingly, this invention utilizes the multi-chip module (MCM) or the 
system on a chip (SOC) technique to integrate an image-sensing module, a sound- 



10 



FILE: 11 670USF.RTF 



sensing module and a signal-transforming module together into an integrated 
audio/video sensor. Such integration not only saves a lot of space but also is able to 
synchronize the video and audio signals. 

[0033] It will be apparent to those skilled in the art that various modifications 
5 and variations can be made to the structure of the present invention without departing 
from the scope or spirit of the invention. In view of the foregoing, it is intended that 
the present invention cover modifications and variations of this invention provided they 
fall within the scope of the following claims and their equivalents. 
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